AITopics | relu null

Collaborating Authors

relu null

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

043ab21fc5a1607b381ac3896176dac6-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 07:28:04 GMT

experiment, precision, relu null, (15 more...)

Neural Information Processing Systems

Country:

Europe > France > Occitanie > Haute-Garonne > Toulouse (0.05)
North America > Canada > Ontario > Toronto (0.04)

Genre: Research Report > New Finding (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.91)

Add feedback

Forward Super-Resolution: How Can GANs Learn Hierarchical Generative Models for Real-World Distributions

Allen-Zhu, Zeyuan, Li, Yuanzhi

arXiv.org Machine LearningJun-4-2021

In practice, by simply training a generator and a discriminator together consisting of multi-layer neural networks with non-linear activation functions, using local search algorithms such as stochastic gradient descent ascent (SGDA), the generator network can be trained efficiently to generate samples from highly-complicated distributions (such as the distribution of images). Despite the great empirical success of GAN, it remains to be one of the least understood models on the theory side of deep learning. Most of existing theories focus on the statistical properties of GANs at the global-optimum [15, 16, 20, 87]. However, on the training side, gradient descent ascent only enjoys efficient convergence to a global optimum when the loss function is convex-concave, or efficient convergence to a critical point in general settings [37, 38, 48, 53, 71, 73, 75, 77, 78]. Due to the extreme non-linearity of the networks in both the generator and the discriminator, it is highly unlikely that the training objective of GANs can be convex-concave. In particular, even if the generator and the discriminator are linear functions over prescribed feature mappings-- such as the neural tangent kernel (NTK) feature mappings [3, 8, 9, 17, 18, 32, 35, 40, 41, 47, 51, 54, 65, 69, 92, 97] -- the training objective can still be non-convex-concave.

poly, relu, relu null, (13 more...)

arXiv.org Machine Learning

2106.02619

Country:

Asia > Middle East > Jordan (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Genre:

Instructional Material (0.85)
Research Report (0.64)

Industry: Education (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.75)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.67)

Add feedback

Convolutional Composer Classification

Verma, Harsh, Thickstun, John

arXiv.org Machine LearningNov-26-2019

The composer classification question has been posed for a variety of corpora, from Renaissance composers [2,3], to the narrow (and challenging) case of Haydn and Mozart string quartets [5, 8, 12, 22], and to various collections of classical era composers (most of the other papers discussed in Section 2). In this work we study an expansive collection of scores, from 13th century sacred music by Guillaume Du Fay to 20th century ragtimes by Scott Joplin. A major challenge of this task is learning from limited data. While the corpus considered here is larger than most, this is largely due to the number of composers considered (19): for specific composers, we have at most 466 scores (Bach) and as few as 22 (Japart). Small datasets are an inherent problem for composer classification: the corpus used in this work contains, for example, all of the Bach chorales and all of the Mozart string quartets. We cannot resurrect these composers and have them write us more scores to include in our corpus. This situation contrasts starkly with many learning problems, where substantial progress can be made by collecting massive datasets and exhaustively training an expressive model (usually a deep neural network) with "big data."

classification, composer, corpus, (14 more...)

arXiv.org Machine Learning

1911.11737

Country: